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Abstract 

Employing profits data of Japanese companies in 2002 and 2003, we confirm that Pareto's 
law and the Pareto index are derived from the law of detailed balance and Gibrat's law. 
The last two laws are observed beyond the region where Pareto's law holds. By classifying 
companies into job categories, we find that companies in a small scale job category have 
more possibilities of growing than those in a large scale job category. This kinematically 
explains that the Pareto index for the companies in the small scale job class is larger than 
that for the companies in the large scale job class. 
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1 Introduction 

The pioneering discovery by Pareto [1] is not only an important cue for research in fractal 
but also a significant issue in economics. The Pareto law states that a cumulative number 
distribution iVp(> x) obeys power-law for income x which is larger than a certain observational 
threshold Xo: 

Np(> x) oc for x > xo, (1) 

where the exponent \x is called Pareto index. Recently Pareto's law is checked with high accuracy 
by using digitalized data [2]. Although the number of persons or companies in the range where 
Pareto's law (1) holds is a few percent, the amount of the income occupies a large part of the total 
income. High income persons or high income companies can, therefore, influence economics. It 
is quite important to investigate the mechanism which governs them. 

The research on the income distribution is well investigated in econophysics [3]. Many mod- 
els to explain the distribution are proposed (for instance recently Refs. [4, 5, 6, 7]). Furthermore, 
Fujiwara et al. [8] find that, without assuming any model, Pareto's law can be derived kinemat- 
ically from the law of detailed balance and Gibrat's law [9] which are observed in high quality 
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digitalized data. They derive not only Pareto's law but also the reflection law which correlates 
a positive growth rate distribution to negative one by the Pareto index fi. This means that 
Pareto's law and index are explained by the law of detailed balance and Gibrat's law. We 
should note that these arguments are limited in the high income region (x > Xo). 

The findings in Ref. [8] are quite fascinating especially in the idea that the Pareto index can 
be understood by the growth rate distribution. In this paper, we first investigate the application 
as follows. The distribution of high income companies follows Pareto's law and the Pareto index 
is about 1. This is called Zipf's law [10]. It is also known that the distributions in most job 
categories follow Pareto's law. Those Pareto indices, however, scatter around 1 [5, 11, 12]. We 
examine the reflection law that relates the change of the Pareto index to the change of the 
growth rate distribution in job categories. 

There is, however, a problem in estimating the Pareto index by the growth rate distribution 
in each job category. The number of companies in several job classes is not enough to observe 
the reflection law. We clear this difficulty by employing profits data which have no threshold in 
contrast to high income data announced publicly. 

By observing all positive profits data, we confirm the law of detailed balance. We also find 
that Gibrat's law holds in the region where the only initial profits are conditioned to be larger 
than the threshold. In this case, Pareto's law is also obtained and the reflection law can be 
observed in companies data classified into job categories. Using these arguments, we confirm 
the correlation between the Pareto index and the growth rate distribution. 

In addition, we find that companies in a small scale job category have more possibilities of 
growing profits than those in a large scale job category. This kinematically explains that the 
Pareto index for the companies in the small job class is larger than that for the companies in 
the large scale job class, which is reported in Ref. [12]. 

2 Pareto's law derived from the detailed balance and Gibrat's 
law 

In this section, we briefly review the proof that the detailed balance and Gibrat's law lead to 
Pareto's law and the reflection law [8]. 

Let the income at two successive points in time be denoted by x\ and x 2 - Growth rate R is 
defined as the ratio R = x 2 /x\. The detailed balance and Gibrat's law can be summarized as 
follows. 

• Detailed balance (Time-reversal symmetry) 

The joint probability distribution function (pdf) P\ 2 {x\,x 2 ) is symmetric: 

Pi 2 (xi,x 2 ) = Pl2(x 2 ,Xi). (2) 

• Gibrat's law 

The conditional probability distribution of growth rate Q{R\x\) is independent of the 
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initial value x±: 

Q(R\x 1 )=Q(R), (3) 
where Q(R\xi) is defined by using the pdf Pi(x±) and the joint pdf P\r(x\, R) as 

Q{R]X1) = M^a, (4) 

These phenomenological properties, for example, are observed in the data of high income 
Japanese companies in 2002 and 2003. In Japan, companies having annual income more than 
40 million yen are announced publicly as "high income companies" every year. The database 
is published by Diamond Inc. In Fig. 1, all the companies are plotted, the income of which in 
2002 (xi) and 2003 (x 2 ) exceeded 40 million yen (xq): 

x\ > xq and x 2 > xq- (5) 

The number of companies which satisfy this condition is 50,632. In Fig. 1, the time-reversal 
symmetry (2) is apparent. In Fig. 2, we confirm Pareto's law (1) for the companies, the income 
of which satisfies the condition (5). The Pareto indices fi are very close to 1, and the case is 
often referred to as Zipf's law [10]. 

We also examine Gibrat's law in the regime (5). Here we divide the range of x\ into logarith- 
mically equal bins as x\ <G 4 x [10 4+a2(n_1 ), io 4+0 - 2n ] with n = 1, • ■ ■ , 5. In Fig. 3, the probability 
density for r is expressed for each bin. In all the cases, all the probability density distributions 
for different n collapse on a single curve approximately. This means that the distribution for the 
growth rate r is approximately independent of the initial value x\. Here the probability density 
for r defined by q(r) is related to that for R by 

log 10 Q(R = 10 r ) + r + log 10 (ln 10) = log 10 q(r). (6) 

Pareto's law (1) can be derived from the detailed balance (2) and Gibrat's law (3) without 
assuming any model [8]. Due to the relation of Pi 2 (xi, x 2 )dxidx 2 = Pir(x\, R)dxidR under the 
change of variables from (x±, X2) to (x\, R), these two joint pdf's are related to each other, 

P 1R (xi,R) = x 1 P 12 (x 1 ,x 2 ). (7) 

By the use of this relation, the detailed balance (2) is rewritten in terms of Pir{x\, R) as follows: 

P 1R ( Xl ,R) = R- 1 P 1R (x 2 , R' 1 ). (8) 

Substituting the joint pdf Pir{x\,R) for the conditional probability Q(R\x\) defined in Eq. (4), 
the detailed balance is expressed as 

Pi{xi) _ 1 Q(i?- 1 |x2) 



Pi(s 2 ) R Q(R\xi) 

From the detailed balance and Gibrat's law (3), one finds the following: 

Pi(ai) _ 1 QjR' 1 ) 
Pi(x 2 ) R Q(R) ' 



(9) 



(10) 
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By expanding this equation around R = 1, the following differential equation is obtained 



G'(l)P 1 (x)+xPl(x) = 0, (If) 

where x denotes x\ and G(R) is defined as Q(R^ 1 )/(R Q(R)). The solution is given by 

P 1 {x)=Cx'^ 1 (12) 

with G'(l) = fjb+1. This is Pareto's law, which leads to the cumulative representation in Eq. (1) 
as follows: N P (> x) = N P (> x )P 1 (> x) = N P (> xq) dyPi(y) oc x'^. 

From Eqs. (12) and (10), one finds the relation between positive growth rate (R > 1) and 
negative one (R < 1), 

Q(R) = R-^QiR- 1 ). (13) 

In terms of q(r) in Eq. (6), this is written as 

logio q(r) = -fir + log 10 q(-r). (14) 

In Fig. 3, we examine this relation, which is called the 'Reflection law' in Ref. [8]. By the use of 
the reflection law to the best-fit line [5] 

log 10 q(r) = c — t+ r for r > 0, (15) 

we obtain the line 

log 10 q(r) = c + t- r for r < (16) 

with 

H = t+-t-, (17) 
where fi ~ 1. The reflected line (16) seems to match the data in Fig. 3 [8]. 



3 Pareto index induced from the growth rate distribution 

From the argument in the preceding section, we expect the estimation of the Pareto index 
by using Eq. (17) under the detailed balance and Gibrat's law. There is, however, a difficulty in 
measuring the slope t- of the reflected line (16). The reason is that the data for large negative 
growth, r < 4 + log 10 4 — log 10 xi, are not available due to the limit (5) (Fig. 3). The reflected 
line (16) does not match the total data which are not classified into bins for instance (Fig. 4). 
Indeed, the Pareto index is estimated to be about from the best fit lines (15) for r > and 
(16) for r < 0. 

In order to avoid this problem, we must remove the threshold xq in the region (5). This 
is, however, impossible as long as we deal with the income data announced publicly in Japan. 
We employ profits as another economic quantity similar to the income. Although profits of 
companies are not announced publicly, the data are available on "CD Eyes" published by Tokyo 
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Shoko Research, Ltd. [13] for instance. This database contains financial information of over 250 
thousand companies in Japan. We deal with the companies, the profits of which in 2002 (xi) 
and 2003 (x 2 ) exceeded yen: 

xi > and x 2 > 0. (18) 

In Fig. 5, all the companies which satisfy this condition are plotted, the number of which is 
132, 499. The time-reversal symmetry (2) is apparent even in Fig. 5. 

We desire to examine Gibrat's law without the threshold. The condition is, therefore, em- 
ployed: 

X\ > xq and x 2 > 0, (19) 

where xq is 40 million yen. The number of the companies which satisfy this limit is 28, 644. 
In Fig. 6, Pareto's law (1) for the companies is observed in the region (5), not in (19). Let us 
examine Gibrat's law in the regime (19). We classify the data for the high profits companies, 
X\ > xq, into five bins in the same manner in the preceding section. The result is shown in 
Fig. 7. We confirm Gibrat's law without the forbid region caused by the threshold. 

Under the extended limit (19), we can derive Pareto's law. In the derivation, we should pay 
attention to the region where Gibrat's law holds as follows: 

Q{R\xi) = Q(R) for xi > x and x 2 > 0, (20) 
Q{R~ l \x 2 ) = Q{R~ l ) for xi > and x 2 > x Q . (21) 

In the region (5) where these two conditions are satisfying, Eq. (10) is relevant and Pareto's law 
(12) holds as a result. This is observed in Fig. 6. 

The reflection law (13) is relevant in the region (5) strictly. Here we assume that Eqs. (13) 
and (14) hold approximately near R = 1 (r = 0) even in the region (19), the validity of which 
should be checked against the results. Under this assumption, we evaluate the Pareto index by 
using Eq. (17). In Fig. 8, the Pareto index is estimated to be 1.02 ± 0.02 from the best fit lines 
(15) for r > and (16) for r < for the total data which are not classified into bins. This is 
consistent with the assumption. We should comment that the Pareto index is estimated to be 
about for the companies, the profits of which in 2002 (xi) and 2003 (x 2 ) exceeded xq. 

4 High profits companies in job categories 

It is reported that income distributions in most job categories follow Pareto's law, but those 
Pareto indices scatter around 1 [5, 11, 12]. The same phenomenon is observed in profits dis- 
tributions. In this section, we classify the companies into job categories in order to verify the 
results in the preceding section. 

In Japan, companies are categorized by Japan Standard Industrial Classification (JSIC). 
This industrial classification is composed of four stages, namely divisions, major groups, groups 
and details (industries). The composition includes 14 divisions, 99 major groups, 463 groups 
and 1,322 industries. The classification in the database "CD Eyes" follows JSIC. 



5 



We classify the 28, 644 companies into the 14 divisions, the profits of which satisfy the 
condition (19). Because the number of companies classified into 99 major groups is not enough 
to investigate the growth rate distribution. The name of each division and the number of 
companies are follows, A: Agriculture (84), B: Forestry (1), C: Fisheries (14), D: Mining (3,512), 
E: Construction (1,182), F: Manufacturing (6,428), G: Electricity, Gas, Heat Supply and Water 
(218), H: Transport and Communications (2,300), I: Wholesale, Retail Trade and Eating & 
Drinking Places (8,699), J: Finance and Insurance (1,994), K: Real Estate (293), L: Services 
(3,919), M: Government, N.E.C. (0) and N: Industries Unable to Classify (0). 

In each job division, we measure t+ and t- of the best fit lines (15) and (16) of all the 
companies in the class. The Pareto index can be estimated from the relation (17) by assuming 
the detailed balance and Gibrat's law. Here we have excluded A, B, C, M and N divisions, 
because the number of companies in each division is not enough to observe the growth rate 
distribution. By comparing the Pareto indices estimated by the growth rate with those directly 
measured in the power-law, we reconfirm the relation (17) in Fig. 9. 

After identifying the Pareto index change as the change of the growth rate distribution in job 
categories, we consider an economic quantity which influences the growth rate. In balance-sheet, 
the sum of capital stock and retained earnings is equity capital, and the sum of equity capital 
and debt is assets. It is natural to consider that a company works based on the assets and gets 
profits (~ income). The positive correlation between assets and income is reported in Ref. [11]. 
It is also pointed out that assets and sales correlate positively in the same reference. On the 
other hand, the positive correlation between sales and capital stock is observed in the database 
"CD Eyes". Consequently there is a positive correlation between assets and capital stock 1 . We 
employ, therefore, average "capital stock" as a quantity which statistically characterizes the 
scale of companies in each job category 2 . 

In Ref. [12], it is reported that there is a relation between the average capital stock of high 
income companies and the Pareto index in 99 major groups. The larger the average capital 
becomes, the smaller the Pareto index becomes. The same phenomenon is observed in profits 
distributions (Fig. 10). This means that the Pareto index is related to the scale of companies in 
job categories. 

We can understand the kinematics by the relation between the average capital and the slope 

of growth rate distribution (t + ,t-). In Fig. 11, t+ decreases with the average capital and i_ 

hardly responds to that. The companies in the small scale job category (Mining for instance) 

have more possibilities of growing profits than the companies in the large scale job category 

(Finance and Insurance for instance). On the other hand, the possibility of growing down is 

independent of the scale of the job category for high profits companies, x\ > Xq. As a result, the 

relation (17) explains that the Pareto index for the companies in the small scale job category is 

larger than that for the companies in the large scale job category. 

lr This correlation is probably caused by the Japanese commercial law. In Japan, capital stock is treated as an 
important quantity to protect creditors. 

2 Indeed, the database "CD Eyes" includes no information about equity capital or assets. 
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5 Conclusion 



In this paper, we have first shown that the Pareto index can be estimated by profits growth 
rate distribution of companies in Japan. The point is that the data have no threshold (xq) in 
contrast to the high income data. We find that the law of detailed balance and Gibrat's law 
hold in the extended region [x\ > 0, X2 > 0) and (x± > xo, X2 > 0), respectively. 

By classifying companies into 14 job divisions, we have secondly confirmed the reflection law 
that relates the change of the Pareto index to the change of the growth rate distribution. 

We thirdly find that the companies in the small scale job category have more possibilities 
of growing profits than the companies in the large scale job category by observing the relation 
between the average capital and the slope of growth rate distribution. The relation between the 
average capital and the Pareto index is also verified. Consequently it is kinematically explained 
that the Pareto index for the companies in the small scale job category is larger than that for 
the companies in the large scale job category. 

We have concentrated our attention to the region {x\ > xo, X2 > 0) in order to examine 
Gibrat's law. Beyond this regime, the breakdown of Gibrat's law is studied, the recent works 
about that are done by Stanley's group [14, 15, 16, 17, 18, 19]. In this paper, we find the law of 
detailed balance over the regime where Pareto's law holds. We expect that the distribution in 
the region {x\ < xq, X2 > 0) can be derived kinematically from the law of detailed balance and 
the results found by Stanley's group. This argument will kinematically decide the distribution 
for the middle region where Pareto's law does not hold. 
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Figure 1: The scatter plot of all companies, incomes of which in 2002 {x\) and 2003 (^2) exceeded 
40 million yen (xq), x\ > xq and X2 > xq. 
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Figure 2: Cumulative number distributions Np(xi) and Np(x2) for companies, the income of 
which in 2002 (x\) and 2003 (#2) exceeded xo, x\ > xq and X2 > xo (xq = 4 x 10 4 thousand 
yen). 
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Figure 3: The probability density distribution g(r) of the log income growth rate r = log 10 X2/X1 
from 2002 to 2003. The data points are classified into five bins of the initial income with equal 
magnitude in logarithmic scale, x\ G 4 x [10 4+a2 ( n_1 ), l0 4+a2n ] (n = l,---,5). The data for 
large negative growth, r < 4 + log 10 4 — log 10 xi, are not available because of the threshold, 
x 2 > Xq. 
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Figure 4: The probability density distribution q(r) of the log income growth rate r from 2002 
to 2003 for all companies. 
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Figure 5: The scatter plot of all companies, the profits of which in 2002 {x\) and 2003 (X2) 
exceeded yen, x\ > and X2 > 0. 
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Figure 6: Cumulative number distributions Np(x±) and Np{x2) for companies, the profits of 
which in 2002 (x±) exceeded xq and those in 2003 {X2) exceeded 0, x\ > xq and X2 > (xq = 10 4 
thousand yen). Note that there is no threshold with respect to x-i- 
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Figure 7: The probability density distribution q(r) of the log profits growth rate r = log 10 X2/X1 
from 2002 to 2003. The data points are also classified into five bins of the initial profits with 
equal magnitude in logarithmic scale: x\ <G 4 x [l0 4 +°- 2 ( n ~ 1 ) , io 4+a2n ] ( n = 1, ■ • • ,5). The data 
for large negative growth r are available in this case. 
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Figure 8: The probability density distribution q(r) of the log profit growth rate r from 2002 to 
2003 for all companies. 
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Figure 9: Pareto indices directly measured in the power-law and differences between t+ and 
t- in job categories. In the legend, each alphabet represents the following division, D: Mining, 
E: Construction, F: Manufacturing, G: Electricity, Gas, Heat Supply and Water, H: Transport 
and Communications, I: Wholesale, Retail Trade and Eating & Drinking Places, J: Finance and 
Insurance, K: Real Estate and L: Services. 



13 



1.5 



x 

<D 

■o 
_c 

o 

o 

i— 
CO 
CL 



0.5 



• 


Total 


X 


D 




E 


□ 


F 


■ 


G 


o 


H 


A 


I 

J 


V 


K 


T 


L 



10 4 10 5 10 6 10 7 10 s 

AverageCapital (inthousandyen ) 

Figure 10: The relation between the average capital and the Pareto index in job categories. 
In the legend, each alphabet represents the following division, D: Mining, E: Construction, F: 
Manufacturing, G: Electricity, Gas, Heat Supply and Water, H: Transport and Communications, 
I: Wholesale, Retail Trade and Eating & Drinking Places, J: Finance and Insurance, K: Real 
Estate and L: Services. 
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Figure 11: The relation between the average capital and t+, t- in job categories. One can 
identify the job category of each point by using the legend in Fig. 10. 
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